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Claims 

1. Method for pre-processing speech, in particular in a method for recognizing 
speech, comprising the steps of: 

- receiving a speech signal (S), 

- separating a spectrum (F) of said speech signal (S) into a given number (N) 
of predetermined frequency sub-bands (F, F N ), 

- analyzing said speech signal (S) within each of said frequency sub-bands 
(F IB .... F N ), 

-thereby generating respective band-dependent acoustic feature data (0 1 

O n ) for each of said respective frequency sub-bands (F lf F N ), which band- 
dependent acoustic feature data (0^ O n ) are at least in part representa- 
tive for said speech signal (S) with respect to a respective frequency sub- 
band (F If .... F N ), 

-deriving band-dependent likelihoods (b, b N ) for occurrences of speech 

elements (PI, Pm) or of sequences thereof within said speech signal (S) 
based on said band-dependent acoustic feature data (O,, O n ) and/or a 
derivative thereof, 

- analyzing said speech signal (S) within said entire spectrum (F), 

- thereby generating full-band acoustic feature data (FBE-F; FFBE; FBE-F- 
SSUB; O fssub ), which are at least in part representative for said speech sig- 
nal (S) with respect to said entire spectrum (F), 

- deriving a full-band likelihood (B FF ; B SSUB ) for occurrences of speech ele- 
ments (PI, Pm) or of sequences thereof within said speech signal (S) 
based on said full-band acoustic feature data (FBE-F; FFBE; FBE-F-SSUB; 
°f.ssub) and/or a derivative thereof, 

-deriving an overall likelihood (B) for occurrences of speech elements (PI, 
Pm) or of sequences thereof within said speech signal (S) based on said 
band-dependent likelihoods (b,, b N ) and said full-band likelihood (B FF ; 
Bssub)- 

2. The method according to claim 1, 
characterized in that 

when deriving said overall likelihood (B) said band-dependent likelihoods (b,, 
.... b N ) are combined to a union model likelihood (B UMFCC ) by determining the 
number of uncorrupted frequency sub-bands of said frequency sub-bands 
(F lt .... F N ), and adding all possible combinations of products of different 
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band-dependent likelihoods (b,, .... b N ) corresponding to respective frequency 
sub-bands. 

3. The method according to any one of the preceding claims, 
characterized in that 

said band-dependent acoustic feature data (0 lf O n ) comprise respective 
band-dependent mel-frequency cepstral coefficient features, which are based 
on mel-frequency cepstral coefficients and/or a derivative thereof derived 
from respective frequency sub-bands (F, F N ). 

4. The method according to any one of the preceding claims, 
characterized in that 

a predetermined broadband noise robustness technique is applied prior to 
deriving said full-band likelihood term (B FF ; B SSUB ). 

5. The method according to claim 4, 
characterized in that 

said broadband noise robustness technique is based on a frequency-filtering 
technique. 

6. The method according to claim 4, 
characterized in that 

said broadband noise robustness technique is based on a method of spec- 
tral-subtraction. 

7. The method according to any one of the preceding claims, 
characterized in that 

said full-band acoustic feature data (FBE-F; FFBE; FBE-F-SSUB; O fssub ) 
comprise filter bank energy features (FBE-F), which are based on filter bank 
energies derived from said entire spectrum (F). 

8. The method according to any one of the preceding claims, 
characterized in that 

said full-band acoustic feature data (FBE-F; FFBE; FBE-F-SSUB; O k . SSU b) 
comprise filtered filter bank energy features (FFBE), which are based on fil- 
tered filter bank energies derived from said entire spectrum (F). 

9. The method according to any one of the preceding claims, 
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characterized in that 

said full-band acoustic feature data (FBE-F; FFBE; FBE-F-SSUB; O fssub ) 
comprise full-band mel-frequency cepstral coefficient features, which are 
based on mel-frequency cepstral coefficients and /or a derivative thereof de- 
5 rived from said entire spectrum (F). 

10. The method according to any one of the preceding claims, 
characterized in that 

said full-band acoustic feature data (FBE-F; FFBE; FBE-F-SSUB; O fssub ) 
10 and/or said band-dependent acoustic feature data (O^ .... O n ) comprise PLP- 

linear prediction filter features, which are based on PLP-linear prediction 
filter coefficients. 

11. The method according to any one of the preceding claims, 
15 characterized in that 

said full-band acoustic feature data (FBE; FFBE; FBE-F-SSUB; O fssub ) com- 
prise spectrally-changed full-band mel-frequency cepstral coefficient fea- 
tures (O fssub ), which are generated by applying a method of spectral sub- 
traction to said full-band mel-frequency cepstral coefficient features (O f ). 

20 

12. The method according to any one of the preceding claims, 
characterized in that 

said band-dependent likelihoods (b^ .... b N ) and said likelihood term (B FF ; 
BssubJ Bu.ff) are determined using a probability estimator. 

25 

13. The method according to any one of the preceding claims, 
characterized in that 

said filtered filter bank energies (FFBE) are derived from said filter bank 
energies (FBE) by subtracting (fli) = f(i+l) - fli-U) a first filter bank 
30 energy (FBE 1X ) from a second filter bank energy (FBE l+1 ), wherein said first 

filter bank energy (FBE M ) corresponds to a first discrete frequency and said 
second filter bank energy (FBE 1+1 ) corresponds to a second discrete 
frequency, lying two discrete frequency steps after said first filter bank 
energy (FBE,.,). 



35 



14. Speech pre-processing system, in particular integrated into a speech proc- 
essing system, which is capable of performing or realizing a method for pre- 
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processing speech according to any one of the preceding claims 1 to 13 
and /or the steps thereof. 

15. Computer program product, 

comprising computer program means adapted to perform and /or to realize 
the method of pre-processing speech according to any one of the claims 1 to 
13 and/ or the steps thereof, when it is executed on a computer, a digital 
signal processing means, and /or the like. 

16. Computer readable storage medium, 

comprising a computer program product according to claim 15. 



